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Abstract. We consider the space of complete and separable metric 
spaces which are equipped with a probability measure. A notion of 
convergence is given based on the philosophy that a sequence of metric 
measure spaces converges if and only if all finite subspaces sampled from 
these spaces converge. This topology is metrized following Gromov's 
idea of embedding two metric spaces isometrically into a common metric 
space combined with the Prohorov metric between probability measures 
on a fixed metric space. We show that for this topology convergence in 
distribution follows - provided the sequence is tight - from convergence 
of all randomly sampled finite subspaces. We give a characterization of 
tightness based on quantities which are reasonably easy to calculate. 

Subspaces of particular interest are the space of real trees and of ultra- 
metric spaces equipped with a probability measure. As an example we 
characterize convergence in distribution for the (ultra-)metric measure 
spaces given by the random genealogies of the A-coalescents. We show 
that the A-coalescent defines an infinite (random) metric measure space 
if and only if the so-called "dust-free" -property holds. 



In this paper we study random metric measure spaces which appear fre- 
quently in the form of random trees in probabihty theory. Prominent ex- 
amples are random binary search trees as a special case of random recur- 
sive trees ([DH05]), ultra- metric structures in spin-glasses (see, for exam- 




ple, |BK0 6'. 'MPVSTj). and coalescent processes in population genetics (for 
example, [HudOO; IEvaOO| ) . Of special interest is the continuum random 
tree, introduced in |Ald93j . which is related to several objects, for example, 
Galton- Watson trees, spanning trees and Brownian excursions. 

Moreover, examples for Markov chains with values in finite trees are the 
Aldous-Broder Markov chain which is related to spanning trees ( |Ald90j ). 
growing Galton- Watson trees, and tree-bisection and reconnection which is 
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a method to search through tree space in phylogenetic reconstruction (see 
e.g., [MPS]). 

Because of the exponential growth of the state space with an increasing 
number of vertices tree- valued Markov chains are - even so easy to construct 
by standard theory - hard to analyze for their qualitative properties. It 
therefore seems to be reasonable to pass to a continuum limit and to con- 
struct certain limit dynamics and study them with methods from stochastic 
analysis. 

We will apply this approach in the forthcoming paper [GPW07] to trees 
encoding genealogical relationships in exchangeable models of populations 
of constant size. The result will be the tree-valued Fleming- Viot dynamics. 
For this purpose it is necessary to develop systematically the topological 
properties of the state space and the corresponding convergence in distribu- 
tion. The present paper focuses on these topological properties. 

As one passes from finite trees to "infinite" trees the necessity arises to 
equip the tree with a probability measure which allows to sample typical 
finite subtrees. In |Ald93j . Aldous discusses a notion of convergence in dis- 
tribution of a "consistent" family of finite random trees towards a certain 
limit: the continuum random tree. In order to define convergence Aldous 
codes trees as separable and complete metric spaces satisfying some special 
properties for the metric characterizing them as trees which are embedded 
into if and equipped with a probability measure. In this setting finite trees, 
i.e., trees with finitely many leaves, are always equipped with the uniform 
distribution on the set of leaves. The idea of convergence in distribution of 
a "consistent" family of finite random trees follows Kolmogorov's theorem 
which gives the characterization of convergence of R-indexed stochastic pro- 
cesses with regular paths. That is, a sequence has a unique limit provided 
a tightness condition holds on path space and assuming that the "finite- 
dimensional distributions" converge. The analogs of finite-dimensional dis- 
tributions are "subtrees spanned by finitely many randomly chosen leaves" 
and the tightness criterion is built on characterizations of compact sets in 
it. 

Aldous's notion of convergence has been successful for the purpose of 
rescaling a given family of trees and showing convergence in distribution 
towards a specific limit random tree. For example, Aldous shows that 
suitably rescaled families of critical finite variance offspring distribution 
Galton- Watson trees conditioned to have total population size converge 
as —s- CO to the Brownian continuum random tree, i.e., the M-tree as- 
sociated with a Brownian excursion. Furthermore, Aldous constructs the 
genealogical tree of a resampling population as a metric measure space asso- 
ciated with the Kingman coalescent, as the limit of A^-coalescent trees with 
weight 1/A^ on each of their leaves. 

However, Aldous's ansatz to view trees as closed subsets of if , and thereby 
using a very particular embedding for the construction of the topology. 
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seemed not quite easy and elegant to work with once one wants to con- 
struct tree- valued limit dynamics (see, for example, |EPW06j . |EW06j and 
[GPW07] ) ■ More recently, isometry classes of M-trees, i.e., a particular class 
of metric spaces, were introduced, and a means of measuring the distance 
between two (isometry classes of) metric spaces were provided based on an 
"optimal matching" of the two spaces yielding the Gromov-Hausdorff metric 
(see, for example. Chapter 7 in [ BBIOl] ). 

The main emphasis of the present paper is to exploit Aldous's philosophy 
of convergence without using Aldous's particular embedding. That is, we 
equip the space of separable and complete real trees which are equipped 
with a probability measure with the following topology: 

• A sequence of trees (equipped with a probability measure) converges 
to a limit tree (equipped with a probability measure) if and only if 
all randomly sampled finite subtrees converge to the corresponding 
limit subtrees. The resulting topology is referred to as the Gromov- 
weak topology (compare Definition 12. Sp . 

Since the construction of the topology works not only for tree-like metric 
spaces, but also for the space of (measure preserving isometry classes of) 
metric measure spaces we formulate everything within this framework. 

• We will see that the Gromov-weak topology on the space of metric 
measure spaces is Polish (Theorem [1]) . 

In fact, we metrize the space of metric measure spaces equipped with the 
Gromov-weak topology by the Gromov-Prohorov metric which combines the 
two concepts of metrizing the space of metric spaces and the space of proba- 
bility measures on a given metric space in a straightforward way. Moreover, 
we present a number of equivalent metrics which might be useful in different 
contexts. 

This then allows to discuss convergence of random variables taking values 
in that space. 

• We next characterize compact sets (Theorem [2] combined with The- 
orem [5]) and tightness (Theorem [3] combined with Theorem [5]) via 
quantities which are reasonably easy to compute. 

• We then illustrate with the example of the A-coalescent tree (Theo- 
rem |4]) how the tightness characterization can be applied. 

We remark that topologies on metric measure spaces are considered in 
detail in Section 3^ of [Gro99j . We are aware that several of our results (in 
particular, Theorems [U [2] and [5|) are stated in \Gm99\ in a different set-up. 
While Gromov focuses on geometric aspects, we provide the tools necessary 
to do probability theory on the space of metric measure spaces. See Remark 
15.31 for more details on the connection to Gromov's work. 

Further related topologies on particular subspaces of isometry classes 
of complete and separable metric spaces have already been considered in 
[StuOGj and [EW06] . Convergence in these two topologies implies conver- 
gence in the Gromov-weak topology but not vice versa. 
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Outline. The rest of the paper is organized as follows. In the next two 
sections we formulate the main results. In Section [2] we introduce the space 
of metric measure spaces equipped with the Gromov-weak topology and 
characterize their compact sets. In Section [3] we discuss convergence in 
distribution and characterize tightness. We then illustrate the main results 
introduced so far with the example of the metric measure tree associated 
with genealogies generated by the infinite A-coalescent in Section |4l 

Sections [5] through [9] are devoted to the proofs of the theorems. In Sec- 
tion [5] we introduce the Gromov-Prohorov metric as a candidate for a com- 
plete metric which generates the Gromov-weak topology and show that the 
generated topology is separable. As a technical preparation we collect re- 
sults on the modulus of mass distribution and the distance distribution (see 
Definition l2.9p in Section [6l In Sections [7] and [8] we give characterizations on 
pre-compactness and tightness for the topology generated by the Gromov- 
Prohorov metric. In Section [9] we prove that the topology generated by the 
Gromov-Prohorov metric coincides with the Gromov-weak topology. 

Finally, in Section [10] we provide several other metrics that generate the 
Gromov-weak topology. 



As usual, given a topological space {X, O), we denote by A^i(X) the space 
of all probability measures on X equipped with the Borel-cr-algebra B{X). 
Recall that the support of ^, supp(;u), is the smallest closed set Xq C X such 
that ^{X\Xq) = 0. The push forward of fi under a measurable map ip from 
X into another metric space {Z, rz) is the probability measure (p^fi G Mi{Z) 
defined by 



for all A G B{Z). Weak convergence in Aii{X) is denoted by 

In the following we focus on complete and separable metric spaces. 

Definition 2.1 (Metric measure space). A metric measure space is a com- 
plete and separable metric space {X, r) which is equipped with a probability 
measure ^ G M.i{X). We write M for the space of measure-preserving isom- 
etry classes of complete and separable metric measure spaces, where we say 
that (X, r, /u) and (X',r',/x') are measure-preserving isometric if there ex- 
ists an isometry between the supports of jj, on (X, r) and of ^' on (X',r') 
such that /u' = It is clear that the property of being measure-preserving 
isometric is an equivalence relation. 

We abbreviate X = (X, r, //) for a whole isometry class of metric spaces 
whenever no confusion seems to be possible. 



2. Metric measure spaces 



(2.1) 



99,//(A) :=/i(v9 \A)) 



Remark 2.2. 
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(i) Metric measure spaces, or short mm-spaces, are discussed in |Gro99] 
in detail. Therefore they are sometimes also referred to as Gromov 
metric triples (see, for example, |Ver98] ) . 

(ii) We have to be careful to deal with sets in the sense of the Zermelo- 
Fraenkel axioms. The reason is that we will show in Theorem [T] that 
M can be metrized, say by d, such that (M, d) is complete and sep- 
arable. Hence if P G 7Wi(M) then the measure preserving isometry 
class represented by (M, d,P) yields an element in M. The way out 
is to define M as the space of measure preserving isometry classes 
of those metric spaces equipped with a probability measure whose 
elements are not themselves metric spaces. Using this restriction we 
avoid the usual pitfalls which lead to Russell's antinomy. □ 

To be in a position to formalize that for a sequence of metric measure 
spaces all finite subspaces sampled by the measures sitting on the corre- 
sponding metric spaces converge we next introduce the algebra of polyno- 
mials on M. 

Definition 2.3 (Polynomials). A function $ = ^^'"^ : M — > M zs called a 
polynomial (of degree n with respect to the test function 0) on M if and only 
ifnGNis the mimimal number such that there exists a bounded continuous 
function (p : [0, oo M such that 

(2.2) <^{{X,r,fi)) = y^«"(d(xi,...,x„))</.((r(x,,x,)) 

where fi'^"' is the n-fold product measure of fi. Denote by 11 the algebra of 
all polynomials on M. 



Example 2.4. In future work, we are particularly interested in tree-like 
metric spaces, i.e., ultra-metric spaces and M-trees. In this setting, func- 
tions of the form (12. 2p can be, for example, the mean total length or the 
averaged diameter of the sub-tree spanned by n points sampled indepen- 
dently according to fi from the underlying tree. □ 

The next example illustrates that one can, of course, not separate metric 
measure spaces by polynomials of degree 2 only. 

Example 2.5. Consider the following two metric measure spaces. 




X y 



Assume that in both spaces the mutual distances between different points 
are 1. In both cases, the empirical distribution of the distances between two 
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points equals ^Sq + |(5i, and hence ah polynomials of degree n = 2 agree. 
But obviously, X and y are not measure preserving isometric. □ 

The first key observation is that the algebra of polynomials is a rich 
enough subclass to determine a metric measure space. 

Proposition 2.6 (Polynomials separate points). The algebra H of poly- 
nomials separates points in M. 

We need the useful notion of the distance matrix distribution. 

Definition 2.7 (Distance matrix distribution). Let X = {X,r,fi) G M and 
the space of infinite (pseudo-) distance matrices 

(2.3) M™^* := {(rij)i<i<j<oo : + rjk > Vik, V 1 < i < j < < oo}. 
Define the map : ^ W''^ by 

(2.4) L^{xi,X2,..) := {r{xi,Xj))^^.^.^^, 
and the distance matrix distribution of X by 

(2.5) := (.^)./i®^. 

Note that for ;f e M and of the form ([2^2]) , we have that 

(2.6) HX)= j y^{<^{r,J)l<i<j)<p[{rij)l<i<J<n)■ 
Proof of Proposition\2^ Let Xn = (X£,r£, € M, i = 1,2, and assume 
that ^{Xi) = <^>{X2), for all $ e H. The algebra {(p G Cb(M(2)); n G N} 
is separating in A4i{W^^^) and so u"^^ = v"^"^ by (j2.6p . Applying Gromov's 
Reconstruction theorem for mm-spaces (see Paragraph 3^.5 in |Gro99j ). we 
find that Xi = X2. □ 

We are now in a position to define the Gromov-weak topology. 

Definition 2.8 (Gromov-weak topology). A sequence {Xn)n&i is said to 
converge Gromov-weakly to X in M if and only if ^[Xn) converges to ^{X) 
in M, for all polynomials <I> G 11. We call the corresponding topology Om. on 
M the Gromov-weak topology. 

The following result ensures that the state space is suitable to do proba- 
bility theory on it. 

Theorem 1. The space (M, Cm) is Polish. 

In order to obtain later tightness criteria for laws of random elements in 
M we need a characterization of the compact sets of (M, Om)- Informally, a 
subset of M will turn out to be pre-compact iff the corresponding sequence 
of probability measures put most of their mass on subspaces of a uniformly 
bounded diameter, and if the contribution of points which do not carry much 
mass in their vicinity is small. 

These two criteria lead to the following definitions. 
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Definition 2.9 (Distance distribution and Modulus of mass distribution). 

Let X = {X,r,fi) G M. 

(i) The distance distribution, which is an element in 7V4i([0, oo)), is 
given by wx '■= r^pi®'^, i.e., 

(2.7) wx{■)■.= ^i''\{x.x')■.r{x,x')e^]. 

(ii) For (5 > 0, define the modulus of mass distribution as 

(2.8) vs{X) := inf |e > : /ijx G X : fMiBe{x)) < 5} < e} 
where ^^(x) is the open ball with radius e and center x. 

Remark 2.10. Observe that wx and are well-defined because they are 
constant on isometry classes of a given metric measure space. 

The next result characterizes pre-compactness in (M, Cm)- 

Theorem 2 (Characterization of pre-compactness). A set T C M is pre- 
compact in the Gromov-weak topology if and only if the following hold. 

(i) The family {wx '■ X €T} is tight. 

(ii) For all e > there exist a 5 = 6(e) > such that 

(2.9) supvs{X)<e. 

Remark 2.11. If F = {Xi, X2, ...} then we can replace sup by limsup in 
([2:9]) . □ 

Example 2.12. In the following we illustrate the two requirements for a 
family in M to be pre-compact which are given in Theorem[2]by two counter- 
examples. 

(i) Consider the isometry classes of the metric measure spaces X^ := 
({1, 2}, r„(l, 2) = n,/in{l} = /in{2} = ^). A potential limit object 
would be a metric space with masses ^ within distance infinity. This 
clearly does not exist. 

Indeed, the family {wx^ = ^Sq + n E N} is not tight, and 
hence {X^, n G N} is not pre-compact in M by Condition (i) of 
Theorem [2l 

(ii) Consider the isometry classes of the metric measure spaces Xn = 
iXn,rn, iJ-n) given for n G N by 

2" 

(2.10) X„ :={!,.. .,2"}, r„(x,y) :=l{x^y}, /i„ := 2"" 5^, 

i=l 

i.e., Xn consists of 2" points of mutual distance 1 and is equipped 
with a uniform measure on all points. 
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(2.111 



1 

2 



1 

4 

X2 




A potential limit object would consist of infinitely many points of 
mutual distance 1 with a uniform measure. Such a space does not 
exist. 

Indeed, notice that for (5 > 0, 



SO sup„gj^ W5(<Y„) = 1, for all 5 > 0. Hence {Xn, n € N} does not 
fulfil Condition (ii) of Theorem [21 and is therefore not pre-compact. 
□ 




3. Distributions of random metric measure spaces 

Prom Theorem [1] and Definition 12.81 we immediately conclude the char- 
acterization of weak convergence for a sequence of probability measures on 
M. 

Corollary 3.1 (Characterization of weak convergence). A sequence (P„)„gN 
in A4i(M) converges weakly w.r.t. the Gromov-weak topology if and only if 

(i) the family {Pn; n € N} is relatively compact in A4i(M), and 

(ii) for all polynomials <I> € 11, (P„[$])„gN converges in M. 

Proof. The "only if" direction is clear, as polynomials are bounded and 
continuous functions by definition. To see the converse, recall from Lemma 
3.4.3 in [EK86| that given a relative compact sequence of probability mea- 
sures, each separating family of bounded continuous functions is convergence 
determining. □ 

While Condition (ii) of the characterization of convergence given in Corol- 
lary [3TT] can be checked in particular examples, we still need a manageable 
characterization of tightness on A4i(M) which we can conclude from Theo- 
rem [2j It will be given in terms of the distance distribution and the modulus 
of mass distribution. 

Theorem 3 (Characterization of tightness). A set A C A4i{M.) is tight if 
and only if the following holds: 
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(i) The family {¥[wx] :¥ £ A} is tight in Mi{R). 

(ii) For all e > there exist a 6 = 6(e) > such that 

(3.1) supF[vs{X)] < £. 

PeA 

Remark 3.2. (i) Using the properties of vs from Lemmata 16.41 and 16.5 
it can be seen that (|3.ip can be replaced either by 



(3.2) sup¥{vs{X) > e} < e 

PeA 



or 



(3.3) supF[n{x : n{Be{x)) < 5}] < e. 

PeA 

(ii) If A = {Pi,P2, ...} then we can replace sup by limsup in ()3.ip . (j3.2p 
and D. □ 



The usage of Theorem [3] will be illustrated with the example of the A- 
coalescent measure tree constructed in the next section, and with examples 
of trees corresponding to spatially structured coalescents ( [GLWOT] ) and of 
evolving coalescents ( |GPW0 7j) in forthcoming work. 

Remark 3.3. Starting with Theorem [3] one characterizes easily tightness 
for the stronger topology given in |Stu06j based on certain L^-Wasserstein 
metrics if one requires in addition to (i) and (ii) uniform integrability of 
sampled mutual distance. 

Similarly, with Theorem [3] one characterizes tightness in the space of 
measure preserving isometry classes of metric spaces equipped with a fi- 
nite measure (rather than a probability measure) if one requires in addition 
tightness of the family of total masses (compare, also with Remark 17.2( 11)). 

□ 



4. Example: A-coalescent measure trees 

In this section we apply the theory of metric measure spaces to a class 
of genealogies which arise in population models. Often such genealogies are 
represented by coalescent processes and we focus on A-coalescents intro- 
duced in |Pit99] (see also Sag99| ). The family of A-coalescents appears in 



the description of the genealogies of population models with evolution based 
on resampling and branching. Such coalescent processes have since been the 
subject of many papers (see, for example, [MSOlj . [BG05], [BBC+n5] [LSOH] . 
[BBSnTj V 

In resampling models where the offspring variance of an individual during 
a reproduction event is finite, the Kingman coalescent appears as a special A- 
coalescent. The fact that general A-coalescents allow for multiple collisions is 
reflected in an infinite variance of the offspring distribution. Furthermore a 
A-coalescent is up to time change dual to the process of relative frequencies of 
families of a Galton- Watson process with possibly infinite variance offspring 
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distribution (compare [BBC"'"05] ). Our goal here is to decide for which A- 
coalescents the genealogies are described by a metric measure space. 

We start with a quick description of A-coalescents. Recall that a partition 
of a set S is a collection {^a} of pairwise disjoint subsets of S, also called 
blocks, such that S = Ua^a- Denote by Sqo the collection of partitions of 
N := {1,2,3,...}, and for all n G N, by the collection of partitions of 
{1, 2, 3, n}. Each partition V G Sqo defines an equivalence relation ~-p by 
i ~p j if and only if there exists a partition element tt £ V with i,j E tt. 
Write pn for the restriction map from Sqo to S^. We say that a sequence 
{Vk)keN converges in Sqo if for all n G N, the sequence {pn'Pk)keN converges 
in §n equipped with the discrete topology. 

We are looking for a strong Markov process ^ starting in Vo G Soo such 
that for all n G N, the restricted process ^„ := Pn°£, is an S„-valued Markov 
chain which starts in PnVo G S„, and given that (,n{t) has b blocks, each k- 
tuple of blocks of is merging to form a single block at rate Xb^k ■ Pitman 
[Pit99| showed that such a process exists and is unique (in law) if and only 
if 

(4.1) \b,k:= [\{dx)x^-\l-xf-^ 

Jo 

for some non-negative and finite measure A on the Borel subsets of [0, 1]. 

Let therefore A be a non- negative finite measure on ^([0, 1]) and V G Sqo- 
We denote by P^'^ the probability distribution governing ^ with ^(0) = V 
on the space of cadlag paths with the Skorohod topology. 

Example 4.1. If we choose 

(4.2) V' :={{!}, {2}, :■}, 

A = (5o, or A(dx) = dx, then P^''^'' is the Kingman and the Bolthausen- 
Sznitman coalescent, respectively. □ 

For each non-negative and finite measure A, all initial partitions V G Sqo 
and P^'^-almost all ^, there is a (random) metric on N defined by 

(4.3) r^{i,j) :=inf{t>0: ^-^^i}- 

That is, for a realization ^ of the A coalescent, r^[i,j) is the time it needs 
i and j to coalesce. Notice that is an ultra-metric on N, almost surely, 
i.e., for all i, j, k £ N, 

(4.4) r^{i,j) < r^{i, k) V r^k,j). 

Let (L^,r^) denote the completion of (N, r^). Clearly, the extension of 
to is also an ultra-metric. Recall that ultra-metric spaces are associated 
with tree-like structures. 

The main goal of this section is to introduce the A-coalescent measure 
tree as the metric space {L^,r^) equipped with the "uniform distribution". 
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Notice that since the Kingman coalescent is known to "come down immedi- 
ately to finitely many partition elements" the corresponding metric space is 
almost surely compact ( [ EvaOOj ) . Even though there is no abstract concept 
of the "uniform distribution" on compact spaces, the reader may find it not 
surprising that in particular examples one can easily make sense out of this 
notion by approximation. We will see, that for A-coalescents, under an ad- 
ditional assumption on A, one can extend the uniform distribution to locally 
compact metric spaces. Within this class falls, for example, the Bolthausen- 
Sznitman coalescent which is known to have infinitely many partition ele- 
ments for all times, and whose corresponding metric space is therefore not 
compact. 

Define Hn to be the map which takes a realization of the Soo-valued 
coalescent and maps it to (an isometry class of) a metric measure space as 
follows: 

(4.5) (L«,r«,/.|:=i^J^^5,). 

Put then for given Vq G Sqo) 

(4.6) Q'^'-:=(/?„)/^'^°. 

Next we give the characterization of existence and uniqueness of the A- 
coalescent measure tree. 

Theorem 4 (The A-coalescent measure tree). The family {Q"^'"; n G N} 
converges in the weak topology with respect to the Gromov-weak topology if 
and only if 

(4.7) [ A(dx)x~^ = oo. 

Jo 

Remark 4.2 ("Dust-free" property). Notice first that Condition (j4.7p is 
equivalent to the total coalescence rate of a given {i} G Vq being infinite 
(compare with the proof of Lemma 25 in |Pit99j ). 

By exchangeability and the de Finetti Theorem, the family {/(vr); vr G 
C(t)} of frequencies 

(4.8) /(vr) := lim -#{j G {1, n} : j G vr} 

n—+oo n 

exists for F^'^o almost ah vr G ^(t) and all t > 0. Define / := (/(vr); vr G 
(,{t)) to be the ranked rearrangements of {/(vr); vr G S,{t)} meaning that the 
entrees of the vector / are non-increasing. Let P"^'^" denote the probability 
distribution of /. Call the frequencies / proper if Yli>i fi'^i) — 1- 
Theorem 8 in |Pit99j . the A-coalescent has in the limit n ^ oo proper 
frequencies if and only if Condition ()4.7|) holds. 

According to Kingman's correspondence (see, for example. Theorem 14 
in [Pit99j), the distribution P^'^o ^nd P^'^o determine each other uniquely. 
For P G Soo and i G N, let := {j G N : i r^-p j} denote the partition 
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element in V which contains i. Then Condition ()4.7p holds if and only if for 
all t > 0, 

(4.9) pA-^o|/((^(i))i)=0}=0. 

The latter is often referred to as the "dust" -free property. □ 



Proof of Theorem For existence we will apply the characterization of tight- 
ness as given in Theorem [3l and verify the two conditions. 

(i) By definition, for all n G N, Q'^'"[wa'] is exponentially distributed with 
parameter A2,2- Hence the family {Q^'"'[wa']; n £ N} is tight. 

(ii) Fix t G (0, 1). Then for all 5 > 0, by the uniform distribution and 
exchangeability, 

Q^'"[^{x:5,(x) <5}] 
(4.10) = P^'^o [fii{x G : fii{Bt{x)) < 6\x = l}] 

= P^-^o{^|(S,(l))<5}. 

By the de Finetti theorem, /^^(^^(l)) ^^-^ /((^(t))i) , P^'^o-almost surely. 
Hence, dominated convergence yields 

lim lim Q^'"r^{x : B^ix) < 6}] = lim P^'^«|/((^(t))i) < 6] 
(4 11) (S^On^oo L ^ ^ 

= P^'^"{/((^(t))') = 0}. 

We have shown that Condition (j4.7p is equivalent to (j4.9p . and therefore, 
using ()3.3p . a limit of Q^'" exists if and only if the "dust-free" -property 
holds. 

Uniqueness of the limit points follows from the projective property, i.e. 
restricting the observation to a tagged subset of initial individuals is the 
same as starting in this restricted initial state. □ 

5. A COMPLETE METRIC: ThE GrOMOV-PROHOROV METRIC 

In this section we introduce the Gromov-Prohorov metric dcPr on M and 
prove that the metric space (M, dcPr) is complete and separable. In Section[9] 
we will see that the Gromov-Prohorov metric generates the Gromov-weak 
topology. 

Notice that the first naive approach to metrize the Gromov-weak topology 
could be to fix a countably dense subset {^n] n G N} in the algebra of all 
polynomials, and to put for X ,y G M, 

(5.1) dnaivc(<^,3^) ■.= Y,2-'^\^n{X)-^n{y)\- 

neN 

However, such a metric is not complete. Indeed one can check that the 
sequence n G N} given in Example I2.12( ii) is a Cauchy sequence w.r.t 
c^naive which docs not converge. 
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Recall that metrics on the space of probability measures on a fixed com- 
plete and separable metric space are well-studied (see, for example, [RacQlt 
IGS02j ). Some of them, like the Prohorov metric and the Wasserstein metric 
(on compact spaces) generate the weak topology. On the other hand the 
space of all (isometry classes of compact) metric spaces, not carrying a mea- 
sure, is complete and separable once equipped with the Gromov-Hausdorff 
metric (see, [EPW06] ). We recall the notion of the Prohorov and Gromov- 
Hausdorff metric below. 

Metrics on metric measure spaces should take both components into ac- 
count and compare the spaces and the measures simultaneously. This was, 
for example, done in |EW06j and |Stu06| . We will follow along similar lines 
as in jSt uOGj . but replace the Wasserstein metric with the Prohorov metric. 

Recall that the Prohorov metric between two probability measures and 
fj,2 on a common metric space {Z,rz) is defined by 

(5.2) 4t'^^Hw'/"2) :=inf {e > : /ii(F) < fi2iF')+e, VF closed} 
where 

(5.3) := {z e Z : rz{z,z') < e, for some z' G F}. 

Sometimes it is easier to work with the equivalent formulation based on 
couplings of the measures fii and H2, i.e., measures jl on X x Y with p,{- x 
Y) = /ii(') and fi{X x •) = /i2(*)- Notice that the product measure ni ® ^2 
is a coupling, and so the set of all couplings of two measures is not empty. 
By Theorem 3.1.2 in |EK86], 

^^'^^ = inf inf |e > : ii{{z,z) ^ Z x Z : rz{z,z) > e] < e}, 

where the infimum is taken over all couplings /i of ^1 and /i2- The met- 
ric (ip^'^^'* is complete and separable if {Z,rz) is complete and separable 
([EK861, Theorem 3.1.7). 

The Gromov-Hausdorff metric is a metric on the space Xc of (isome- 
try classes of) compact metric spaces. For {X,rx) and (y, ry) in Xc the 
Gromov-Hausdorff metric is given by 

(5.5) dGH((^,rx),(y,ry)) := inf di{^x{X),^Y{Y)) , 

where the infimum is taken over isometric embeddings ipx and from 
X and y, respectively, into some common metric space {Z,rz), and the 
Hausdorff metric d^'^^^ for closed subsets of a metric space {Z,rz) is given 
by 

(5.6) d\^'''^\x,Y) :=inf{e>0: X QY',Y CX'}, 
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where and are given by ([O]) (compare |(Tro99[ IBHMI IBBIOlj ). 

Sometimes, it is handy to use an equivalent formulation of the Gromov- 
Hausdorff metric based on correspondences. Recall that a relation R between 
two compact metric spaces {X,rx) and (F, ry) is any subset of X x y. A 
relation R C X x Y is called a correspondence iff for each x G X there 
exists at least one y G Y such that {x, y) S R, and for each y' £ Y there 
exists at least one x' X such that {x',y') G R. Define the distortion of a 
(non-empty) relation as 

(5.7) dis(i?) :=sup{|rx(x,x') -ry(y,y')l : {x,y), {x' ,y') G R} . 

Then by Theorem 7.3.25 in [BBIOlj . the Gromov-Hausdorff metric can be 
given in terms of a minimal distortion of all correspondences, i.e., 

(5.8) dGH{{X,rx),{Y,rY)) = ^ inf dis(i?), 

where the infimum is over all correspondences R between X and Y. 

To define a metric between two metric measure spaces X = {X,rx, fJ-x) 
and y = (y, ry,^y) in M, we can neither use the Prohorov metric nor the 
Gromov-Hausdorff metric directly. However, we can use the idea due to 
Gromov and embed {X, rx) and (Y, ry) isometrically into a common metric 
space and measure the distance of the image measures. 

Definition 5.1 (Gromov-Prohorov metric). The Gromov-Prohorov distance 
between two metric measure spaces X = {X,rx , fJ-x) and y = (y, ry,/iy) in 
M is defined by 

(5.9) dGFr[X,y):= inf df/^\{^x).llX,{^Y)*lJiY), 

where the infimum is taken over all isometric embeddings ipx and ipy from 
X and Y , respectively, into some common metric space {Z,rz)- 

Remark 5.2. (i) To see that the Gromov-Prohorov metric is well- 
defined we have to check that the right hand side of (|5.9|) does 
not depend on the element ofthe isometry class of {X,rx, fJ-x) and 
{Y,rY,HY)- We leave out the straight-forward details, 
(ii) Notice that w.l.o.g. the common metric space {Z,rz) and the iso- 
metric embeddings ipx and ipy from X and Y can be chosen to be 
X UY and the canonical embeddings ipx and cpy from X and Y 
to X U y, respectively (compare, for example, Remark 3.3(iii) in 
|Stu06] ). We can therefore also write 

(5.10) dGPri^^y) ■■= inf ' \iipx)*lJ'X, {^y)*IJ'y) , 

where the infimum is here taken over all complete and separable 
metrics r^'^ which extend the metrics rx on X and ry on Y to 
XUY. 
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Remark 5.3 (Gromov's Dj^-nietric) . Even though the material presented 
in this paper was developed independently of Gromov's work, some of the 
most important ideas are already contained in Chapter 3^ in |Gro99] . 

More detailed, one can also start with a Polish space {X, O) which is 
equipped with an probability measure ^ G Mi{X) on B{X), and then in- 
troduce a metric r : X x X — > as a measurable function satisfying the 
metric axioms. Polish measure spaces (X, /u) can be parameterized by the 
segment [0, 1) where the parametrization refers to a measure preserving map 
(p : [0, 1) X. If r is a metric on X then r can be pulled back to a metric 
(99^^)*r on [0, 1) by letting 

(5.11) {^'\r{t,t') ■.= r[^{t),^{t')). 

Notice that such a measure-preserving parametrization is far from unique 
and Gromov introduces his D^-distance between {X,r,fi) and {X',r',fj,') as 
the infimum of distances Di between the two metric spaces ([0, 1), {^p^^)^:r) 
and ([0, 1), {'>p~^)*r') defined as 

□i (d, d') 

(5.12) := sup {e > : 3Xe £ B{[0, 1)) : X{X,) < e, s.t. 

\d{ti,t2) -d'{h,t2)\ <e, \/h,t2eX\X,}, 

where the infimum is taken all possible measure preserving parameteriza- 
tions and A denotes the Lebesgue measure. 

The interchange of first embedding in a measure preserving way and then 
taking the distance between the pulled back metric spaces versus first embed- 
ding isometrically and then taking the distance between the pushed forward 
measures explains the similarities between Gromov's e-partition lemma (Sec- 
tion 3i.8 in [Gro99j l. his union lemma (Section 3^.12 in |Gro99| ) and his 
pre-compactness criterion (Section 3|.D in |Gro99j ) on the one hand and 
our Lemma 16.91 Lemma 15.81 and Proposition 17. 11 respectively, on the other. 

We strongly conjecture that the Gromov-weak topology agrees with the 
topology generated by Gromov's D^-metric but a (straightforward) proof is 
not obvious to us. □ 

We first show that the Gromov-Prohorov distance is indeed a metric. 

Lemma 5.4. dcPr defines a metric on M. 

In the following we refer to the topology generated by the Gromov- 
Prohorov metric as the Gromov-Prohorov topology. In Theorem [5] of Sec- 
tion [9] we will prove that the Gromov-Prohorov topology and the Gromov- 
weak topology coincide. 

Remark 5.5 (Extension of metrics via relations). The proof of the lemma 
and some of the following results is based on the extension of two metric 
spaces {Xi,rxi) and (X2, rx2) if a non-empty relation R C Xi XX2 is known. 
The result is a metric on Xi U X2 where U is the disjoint union. Recall the 
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distortion of a relation from (15. 7|] Define the metric space {Xi UX2, r^^y^^) 
by letting r^_^^j^^{x,x') := rXi{x,x') if x,x' € Xi, i = 1,2 and for xi G Xi 
and X2 € X2, 

(5 13) ^^1^X2(2^1 '2^2) 

:= inf {rxi(xi,Xi) + idis(i?) +rx2(a:^2,4) : G -R}- 

It is then easy to check that r^^^x^ defines a (pseudo-)metric on Xi U 
which extends the metrics on Xi and X2. In particular, 'f^^^j^^{xi,X2) = 
^dis(i?), for any pair (xi,X2) G i?, and 

(5.14) (f^'^^'''^^i^''2\7rii?,7r2-R) = idis(i?), 

where vri and -7r2 are the projection operators on Xi and X2, respectively. 
□ 

Proof of Lemma \5.4\ Symmetry is obvious and positive definiteness can be 
shown by standard arguments. To see the triangle inequality, let e,5 > 
and P^i := {Xi,rXi, fJ-xJ S M, z = 1,2,3, be such that c?GPr('^i; ^^2) < e 
and dGPr('^2 5 '^3) < Then, by the definition (|5.9p together with Re- 
mark [S^Kii) , we can find metrics r^'^ and r^'^ on Xi U X2 and X2 U X3, 
respectively, such that 

(5.15) 4^'^^''''' ■'((<^i)*/^Xi, (<^2)*^X2) < 
and 

(5.16) 4f^^^^''-''') (((^'2)^/^X2, ('^3)*^X3) < S, 

where (pi, (p2 and (p'2, (fs are canonical embeddings from Xi,X2 to Xi U X2 
and X2,X3 to X2 U X3, respectively. Setting Z := {Xi U X2) U {X2 U X3) 
we define the metric r^ on Z using the relation 

(5.17) R := {{^2{x),iP2{x)) : X G X2} C (Xi U X2) x (X2 U X3) 

and Remark 15.51 Denote the canonical embeddings from Xi, the two copies 
of X2 and X3 to Z by 921, (/?2; ¥'2 ^^^d ip^, respectively. Since dis(i?) = and 

(5.18) 4!''^^(('^2)*M2, (V'2)*/^2) = 0, 

by the triangle inequality of the Prohorov metric, 
(5.19) 

C^GPr('^l,'^3) < dp/''\{ipi)^fli, (V?3)*^3) 

< dp/''\{(pi)^fli, {(p2)^fl2) + dp/''\{(p2)*fi2, {'P'2)*fJ'2) 

<e + 5. 

Hence the triangle inequality follows by taking the infimum over all e and 5. 

□ 
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Proposition 5.6. The metric space is (M, dcPr) is complete and separable. 
We prepare the proof with a lemma. 

Lemma 5.7. Fix (e„)neN in (0,1). A sequence [Xn ■= (-^n, "Tn, ^n))neN in 
M satisfies 

(5.20) dGPr:{Xn,Xn+l) < £„ 

if and only if there exist a complete and separable metric space {Z, rz) and 
isometric embeddings ipi, ip2, ■■■ from Xi, X2, respectively, into {Z,rz), 
such that 

(5.21) d^p/^^ ((Vn)*Ain, {y^n+l)*fJ-n+l) < £■«• 

Proof. The "if" direction is clear. For the "only if" direction, take sequences 
(Xn := {Xn,rn, Hn))nen and (e„)„gN which satisfy (j5.20p . By Remark 15.21 
for Yn := Xn U Xn+i and all n E N, there is a metric ry^ on Yn such that 

(5.22) d^p^'''^"\{(pn)*fJ-n,{'Pn+l)*tJ-n+l) < £n 

where ipn and (pn+i are the canonical embeddings from Xn and Xn+i to Yn- 
Put 

(5.23) Rn := {{x,x') £ Xn x Xn+i : ry„(v3„(x), 99„+i(x')) < ^n}- 

Recall from (|5.4|) that (j5.22|) implies the existence of a coupling /i„ of ((/?n)*/U„ 
and (ipn+i)*fin+i such that 

(5.24) fln{{x,x') : rY„{y,y') < £„} > 1 - 
This implies that i?„ is not empty and 

(5.25) ^Pr"' ^" {{^n)*fJ'n,{^n+l)*fJ-n+l) < en- 

Using the metric spaces (l^,ry") we define recursively metrics rz„ on 
Zn ■= Ufc=i^A;- Starting with n = 1, we set {Zi,rzi) ■= {Xi,ri). Next, 
assume we are given a metric rz„ on Z„. Consider the isometric embeddings 
from Xfc to Zn, for k = 1, ...,n which arise from the canonical embedding 
of Xfc in Zn- Define for all n € N, 

(5.26) Rn := {iz,x) e Zn x Xn+i : ((V'")-'(^), x) G Rn} 

which defines metrics '"^"^^ on Zn+i via (j5.13p . 

By this procedure we obtain in the limit a separable metric space {Z' := 
1X^=1 Xn,rz'). Denote its completion by {Z,rz) and isometric embeddings 
from Xn to Z which arise by the canonical embedding hy ipn,n. G N. Observe 
that the restriction of rz to Xn U Xn+i is isometric to {Yn, i"y") and thus 

(5.27) d\^/^\{'tpn)*^J'X„, {i^n+l)*fJ'X„+i) < £n 

by (|5.25|) . So the claim follows. □ 
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Proof of Proposition \5.6[ To get separability, we partly follow the proof of 
Theorem 3.2.2 in |EK86j . Given X := {X,r,fi) G M and e > 0, we can 
find := (X, r, /i*") € M such that /i^ is a finitely supported atomic mea- 
sure on X and (ipr(^^,/i) < e. Now dcPrlX'^ , < e, while is just a 
"finite metric space" and can clearly be approximated arbitrary closely in 
the Gromov-Prohorov metric by finite metric spaces with rational mutual 
distances and weights. The set of isometry classes of finite metric spaces 
with rational edge-lengths is countable, and so (M, dcPr) is separable. 

To get completeness, it suffices to show that every Cauchy sequence 
has a convergent subsequence. Take therefore a Cauchy sequence 
('^n)neN in (Mjdcpr) and a subsequence (3^n)neN, 3^n = (^n,'^n,/"n) with 
dGPr(3^n; 3^n+i) < 2~"'. By Lemma [STTl we can choose a complete and sep- 
arable metric space {Z,rz) and, for each n € N, an isometric embedding 
ifn from Yn into {Z,rz) such that {{ipn)*lJ"n)n£N is a Cauchy sequence on 
M.i{Z) equipped with the weak topology. By the completeness of M.i[Z), 
{{<fn)*lJ'n)n(^N Converges to some fi G Mi{Z). 

Putting the arguments together yields that with Z := {Z,rz,fl), 

(5.28) dGPr(Jn,2) "=^0, 

SO that Z is the desired limit object, which finishes the proof. □ 

We conclude this section by another Lemma. 

Lemma 5.8. Let X = {X,r,fi), Xi = {Xi,ri,fii), X^ = {X2,r2, H2), ■■■ be 
in M. Then, 

(5.29) dGPr{Xn,X) ^^=^0 

if and only if there exists a complete and separable metric space {Z,rz) and 
isometric embeddings ip,ipi,ip2, ... from X,Xi,X2 into {Z,rz), respectively, 
such that 

(5.30) d^^/^\{ipn)*lJ-n,'P*lj) "^°°> 0. 

Proof. Again the "if" direction is clear by definition. For the "only if" 
direction, assume that (j5.29p holds. To conclude (15.30p we can follow the 
same line of argument as in the proof of Lemma 15.71 but with a metric r 
extending the metrics r, ri, r2,... built on correspondences between X and 
Xn (rather than Xn and X„_|_i). We leave out the details. □ 

6. Distance distribution and Modulus of mass distribution 

In this section we provide results on the distance distribution and on the 
modulus of mass distribution. These will be heavily used in the following 
sections, where we present metrics which are equivalent to the Gromov- 
Prohorov metric and which are very helpful in proving the characterizations 
of compactness and tightness in the Gromov-Prohorov topology. 
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We start by introducing the random distance distribution of a given metric 
measure space. 

Definition 6.1 (Random distance distribution). Let X = {X,r,fi) € M. 
For each x G X, define the map r^ X [0, oo) by rx{x') := r{x,x'), and 
put fi^ := {rx)*fi G A^i([0,oo)), i.e., fi^ defines the distribution of distances 
to the point x G X. Moreover, define the map r : X — A1i([0, oo)) by 
f{x) := fj,^, and let 

(6.1) fix ■■= r./i G Mi{Mi{[0, oo))) 

be the random distance distribution of X. 

Notice first tliat the random distance distribution does not characterizes 
the metric measure space uniquely. We wih ihustrate this with an example. 

Example 6.2. Consider the following two metric measure spaces: 




That is, both spaces consist of 8 points. The distance between two points 
equals the minimal number of edges one has to cross to come from one point 
to the other. The measures fix and /iy are given by numbers in the figure. 
We find that 

fi-x = fi-y= iTT'^j..,^ , 9_,^ , 1,^ + y±,,^ , 2^, , 1, 

(6.2) 



10 20 "^o +20 ^2+ 2 ''a 5 _5(,_|__52 + _53 
+ ^'^3. , 7. ,1. +|^lr, 3. ,U- 



Hence, the random distance distributions agree. But obviously, X and y 
are not measure preserving isometric. □ 

Recall the distance distribution w. and the modulus of mass distribution 
vs{-) from Definition 12.91 Both can be expressed through the random dis- 
tance distribution fi{-). These facts follow directly from the definitions, so 
we omit the proof. 

Lemma 6.3 (Reformulation of w. and vs{-) in terms of /*(•))• -^ei X € M. 
(i) The distance distribution wx satisfies 



(6.3) wx = I ftx{du) V. 

JA^i([0,oo)) 



20 ANDREAS GREVEN, PETER PFAFFELHUBER, AND ANITA WINTER 

(ii) For all 6 > 0, the modulus of mass distribution vs{X) satisfies 

(6.4) vsiX) = inf {e > : ilx{i^ G Xi([0,oo)) : z^([0,e)) < 5} < e}. 

The next result will be used frequently. 

Lemma 6.4. Let X = {X,r,fi) G M and 6 > 0. If vs{X) < e, for some 
e > 0, then 

(6.5) ^{x G X : ^i{Be{x)) <5] <£. 

Proof. By definition of vs{-), there exists e' < e for which ;u{x G X : 
IJ,{B^i{x)) < 5} < e' . Consequently, since {x : n{B^{x)) < 5} ^ {x : 
^^{BAx)) < S}, 

(6.6) : niBeix)) < 5} < fi{x : ^i{B^i{x)) < 5} < < e, 

and we are done. □ 

The next result states basic properties of the map 5 ^ vs- 

Lemma 6.5 (Properties of vs{-))- Fix X G M. The map which sends 5 > 
to v^[X) is non- decreasing, right- continuous and hounded by 1. Moreover, 

vsiX) '-=5 0. 

Proof. The first three properties are trivial. For the forth, fix e > 0, and 
let X = {X, r, fi) G M. Since X is complete and separable there exists a 
compact set C X with iJi,{Kf,) > 1 — e (see |EK86j . Lemma 3.2.1). In 
particular, can be covered by finitely many balls Ai^ Ai\f^ of radius 
e/2 and positive /i-mass. Choose 5 such that 

(6.7) < (5 < min{/i(Ai) : 1 < i < A^e}. 



Then 



^l{x e X : ^{B,{x)) > 5] > ^l{[J!^^^Ai) 

(6-8) > ti{K^) 

>l-e. 

Therefore, by definition, vs{X) < e, and since e was chosen arbitrary, the 
assertion follows. □ 

The following proposition states continuity properties of fi{-), w. and vs{-). 
The reader should have in mind that we finally prove with Theorem [5] in 
Section [9] that the Gromov-weak and the Gromov-Prohorov topology are the 
same. 

Proposition 6.6 (Continuity properties of /i('), w. and vs{-)). 

(i) The map X fix is continuous with respect to the Gromov-weak 
topology on M and the weak topology on Ali(A1i([0, oo))). 
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(ii) The map X i— > jix is continuous with respect to the Gromov-Prohorov 
topology on M and the weak topology on A^i(A1i([0, oo))). 

(iii) The map X i— > wx is continuous with respect to both the Gromov- 
weak and the Gromov-Prohorov topology on M and the weak topology 
on Mi{[0, oo)). 

(iv) Let X, Xi, X2, ... in M such that fix„ '^^^ fi^ and 6 > 0. Then 

(6.9) limsupt>5(<Y„) < vsiX). 

n^oo 

The proof of Parts (i) and (ii) of Proposition 16.61 are based on the notion 
of moment measures. 

Definition 6.7 (Moment measures of fix)- For X = {X,r,fi) E M and 
k gN, define the k^^ moment measure fi^ G A4i([0, oo)'^') of fix by 

(6.10) /i^(d(ri,...,rfc)) := / fix{di^) i^^H^in, ...,rk)). 



Remark 6.8 (Moment measures determine fix)- Observe that for aU A; G N, 



/i^(^i X ... X Ak) 

= |U®''+^{(uo,iii, ...,Mfc) : r{uo,ui) G Ai, ...,r{uo,Uk) G Ak}. 

By Theorem 16.16 of |Kal02j . the moment measures fi^,k = 1,2,... de- 
termine fix uniquely. Moreover, weak convergence of random measures is 
equivalent to convergence of all moment measures. □ 

Proof of Proposition [FTSl (i) Take X, Xi, X2, ... in M such that 
(6.12) $(Af„) ^X), 

for all <^ G n. For k ^ N, consider all (j) G Cb([0, 00)'^ 2 J) which depend 
on (rjj)o<i<i<fc only through (ro,i, ro,fc), i.e., there exists ((> G Cb([0, cx))'') 
with </'((ry)o<i<i<fc) = <^((^o,i)i<i<fc)- Since for any y = (y,r,/i) G M, 

Ay(d(n, ...,rfc))<^(ri, ...,rfc) 



(6.13) = / iJ.'^'''^^{d{uo,ui, ...,Uk)) (t){r{uo,ui), ...,r{uo,Uk)) 

;U®''+^(d(no,iii, ...,nfc)) (t>{{r{ui,Uj))o<i<j<k) 



it follows from ()6.12|) that /i^^ 'i^^ ^fe^ -^^ topology of weak convergence. 
Since k was arbitrary the convergence fix„ '^^^ fip^; follows by Remark 16.81 

(ii) Once more it suffices to prove that all moment measures converge. 

Let X = (X,rx, fJ'x) £ M and e > be given. Now consider a metric 
measure space y = (y, ry,//y) G M with dGPriX,y) < e. 
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We know that there exists a metric space {Z,rz), isometric embeddings 
(fx and ifY of supp(/xx) and supp(^y) into Z, respectively, and a couphng 
fl of {ipx)*fJ'X and (v3y)*/iy such that 

(6.14) fi{{z,z') ■.rziz,z')>e} <e. 

Given A; E N, define a couphng //'^ of /i^ and fiy by 
(6.15) 

/i'=(^l X ••• X ^fc X Si-- - X Sfc) 

:= A®^''"^^H(^0,4)' (^fc,4) : rz{zo,Zi) G ^i,rz(4,^i) ^ Bi,i= 1, 
for aU X - - - X Afc X 5i X • • • X Bfc G ^(M^fc). Then 

/i'^l (ri, rfc, r'l, r^) : |rj — > 2e for at least one 

< /c - /u^{(ri, r'l) : |ri — r'll > 2e} 

(6.16) =k-fc'^mz,z'),{zrz'): \rz{z, z) - rz{z' , z')\ > 2e} 
<k- f,'^mz,z'),{2,S') : rz{z,z')>e or rz{z,z')>e} 

< 2ke, 

which implies that {fi'^ , fiy) < 2ke, and the claim follows. 

(iii) By Part (i) of Lemma 16.31 for X G M, wx equals the first moment 
measure of fix- The continuity properties of A" i-^- wx are therefore a direct 
consequence of (i) and (ii). 

(iv) Let X, Xi, X2, ... in M such that (lx„ jlp^ and 5 > 0. Assume 
that e > is such that e > vs{X). Then by Lemmata 16.3^ 11) and 16.41 

(6.17) fLx{iy G 7Wi([0,oo)) : z.([0,e)) < <5} < e. 

The set {u G A1i([0,oo)) : J^([0,e)) < 6} is closed in Mi{[0,oo)). Hence 
by the Portmanteau Theorem (see, for example, Theorem 3.3.1 in |EK86| ). 

limsup/iAf„{i^ G 7Wi([0,oo)) : ui[0,e)) < 5} 

(6.18) 

< fixii^ G A^i([0,oo)) : u{[0,e)) < 6} < e. 

That is, we have vs{Xn) < e, for all but finitely many n, by ()6.9p . There- 
fore we find that hmsupn_^vs{Xn) < e. This holds for every e > vs{X), 
and we are done. □ 

The following estimate will be used in the proofs of the pre-compactness 
characterization given in Proposition 17.11 and of Part (i) of Lemma 110.31 

Lemma 6.9. Let 5 > 0, e > 0, and X = {X,r,fi) G M. If vs{X) < e, then 
there exists N < [|J and points xi, ...,xn G X such that the following hold. 

N 

• Fori = 1, ...,N, fi{Be{xi)) > 6, and n{ \J B2e{xi)) > 1 - e. 

i=l 

• For all i,j = 1, N with i ^ j, r[xi,Xj) > e. 
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Proof. Consider the set D := {x £ X : fj,{B^{x)) > 6}. Since vsi-V) < £, 
Lemma 16.41 implies that //(D) > 1 — e. Take a maximal 2e separated net 
{xi : i £ 1} C D, i.e., 

(6.19) DQ]jB2eix^), 
and for all i ^ j, 

(6.20) r{xi,Xj)>2e, 

while adding a further point to D would destroy (j6.20p . Such a net exists 
in every metric space (see, for example, in |BBI01 j. p. 278). Since 

(6.21) 1 > /^( U Bsi^i)) = Yl y^{Be{xi)) > m 

\I\ < [ij follows. □ 



7. Compact sets 

By Prohorov's Theorem, in a complete and separable metric space, a set 
of probability measures is relatively compact iff it is tight. This implies that 
compact sets in M play a special role for convergence results. In this section 
we characterize the (pre-)compact sets in the Gromov-Prohorov topology. 

Recall the distance measure wx from (j2.7p and the modulus of mass dis- 
tribution vs{X) from (j2.8p . Denote by (Xc^gh) the space of all isometry 
classes of compact metric spaces equipped with the Gromov-Hausdorff met- 
ric (see Section [5] for basic definitions). 

The following characterizations together with Theorem [5] stated in Sec- 
tion[9]which states the equivalence of the Gromov-Prohorov and the Gromov- 
weak topology imply the result stated in Theorem [2l 

Proposition 7.1 (Pre-compactness characterization). Let T be a family in 
M. The following four conditions are equivalent. 

(a) The family T is pre-compact in the Gromov-Prohorov topology. 

(b) The family \^w{X); G P} is tight, and 

(7.1) supvsiX) ^^0. 

(c) For all e > there exists A'e € N such that for all X = (X, r, /u) S P 
there is a subset Xf^^x ^ X with 

— l^{Xe,x) >l-e, 

— X^^x can be covered by at most balls of radius £, and 

— X^^x has diameter at most Nf,. 

(d) For all £ > and X = (X, r, ^) € P there exists a compact subset 
Ke,x ^ X with 

— ^[Ki;^x) > 1 — £; o.'iT'd 

— the family fCe := {Kg^ x', X £T} is pre-compact in (Xcdcn)- 
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Remark 7.2. 

(i) In the space of compact metric spaces equipped with a probabiUty 
measure with full support, Proposition 2.4 in |EW06| states that 
Condition (d) is sufficient for pre-compactness. 

(ii) Proposition 17.1( b) characterizes tightness for the stronger topology 
given in |Stu06j based on certain L^-Wasserstein metrics if one re- 
quires in addition uniform integrability of sampled mutual distance. 

Similarly, (b) characterizes tightness in the space of measure pre- 
serving isometry classes of metric spaces equipped with a finite mea- 
sure (rather than a probability measure) if one requires in addition 
tightness of the family of total masses. □ 



Proof of Proposition \ 7. 1\ As before, we abbreviate X = {X,rx , fJ-x)- We 
prove four implications giving the statement. 

(a) =^ (6). Assume that F E M is pre-compact in the Gromov-Prohorov 
topology. 

To show that {w{X); Af € P} is tight, consider a sequence Xi,X2,--- 
in P. Since P is relatively compact by assumption, there is a converging 

subsequence, i.e., we find X £ Wl such that dcpri^nk, *— ^ along a 
suitable subsequence {nk)keN- By Part (iii) of Proposition WM wx„^ 
wx- As the sequence was chosen arbitrary it follows that |w(<Y); Af E P} is 
tight. 

The second part of the assertion in (b) is by contradiction. Assume that 
vs{X) does not converge to uniformly in A" E P, as (5 ^ 0. Then we find 
an e > such that for all n E N there exist sequences (5n)neN converging to 
and E P with 

(7.2) V5„m>e. 

By assumption, there is a subsequence {AW^.; k E N}, and a metric mea- 
sure space A" E P such that dcPr ( A'nfc , A") '^-^^ 0. By Parts (ii) and (iv) of 
Proposition 16.61 we find that lim sup^^oo ■u^^ (-^rn.) = which contradicts 

(6) =^ (c). By assumption, for all e > there are C{e) with 

(7.3) su-pwx{[C{e),oo))<e, 

and 6{e) such that 

(7.4) supvs(^^){A:) <e. 
Set 



(7.5) 



X^;, :={xEX: ^Jix{B 2 ix)) > I - e/2] . 
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We claim that iix{X'^ ^) > 1 — e/2. If this were not the case, there would 
be ;f G r with 

> fif{ix,x') : X i K^x.x' i S^^ei/^)} 

which contradicts (|7.3p . Furthermore, the diameter of X'^ is bounded 

by 4C(^). Indeed, otherwise we would find points x,x' G X'^ x with 
B ^2 (x) r\B -2. (x'\ = 0, which contradicts that 

(7.7) 

^^x {B ,2 (x) n 2 (xO) > 1 - MX (Ci3 ,2 (x)) - ^lx (Ci? ,2 (x')) 

> 1 - e. 



By Lemma 16.91 for ^ = {X^rx^^J-x) G T) we can choose points 
xi,...,x^rA- G X with iVf < A^(e) := Vji^\, rx{xi,Xj) > e/2, 1 < i < 

j < N^, and with /ix(US B,{xi)) > 1 - e/2. 
Set 

(7.8) :=X;;,nUi?,(x,). 

Then /ux(^e,A') > 1 — £• In addition, Xf^^x can be covered by at most 
A^(e) balls of radius e and X'^ has diameter at most 4C(^), so the same 
is true for X^^x- 

(c) =^ (d). Fix e > 0, and set e„ := e2~("+^\ for all n G N. By assumption 
we may choose for each n G N, N^^ G N such that for all G F there is a 
subset Xi^^^^x ^ ^ of diameter at most A'g,^ with ^i^X^^^x) > 1 — En, and 
such that -'^e„,A' can be covered by at most A^^^ balls of radius e„. Without 
loss of generality we may assume that all {Xg^^^f ; n G N, <Y G F} are closed. 
Otherwise we just take their closure. For every G F take compact sets 
Ks^^x ^ X with iix{Ks,,,x) > 1 - Then the set 

oo 

(7.9) K,,x:= n {Xe^,xnK,^,x) 

n=l 

is compact since it is the intersection of a compact set with closed sets, and 



oo 

(7.10) f^xiKe,x) > 1 - E (/"x(CX,„,;f ) + /Xx(CKe„,A')) > 1 " £■ 

n=l 
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Consider 

(7.11) lC,:={K,x,X^^]- 

To show that /C^ is pre-compact we use the pre-compactness criterion 
given in Theorem 7.4.15 in [BBIOlj . i.e., we have to show that /C^ is uniformly 
totally bounded. This means that the elements of K.^ have bounded diameter 
and for all e' > there is a number A^^/ such that all elements of /C^ can be 
covered by N^i balls of radius e' . By definition, K-£,x ^ Xg^^^x and so, K-e,x 
has diameter at most N^-^. So, take e' < e and n large enough for En < £' ■ 
Then Xi^^^x as well as K^^x can be covered by A''^^ balls of radius e' . So /C^ 
is pre-compact in (Xcdcn)- 

{d) =?■ (a). The proof is in two steps. Assume first that all metric spaces 
(X, rx) with (X, rx,/Ux) £ T are compact, and that the family {{X,rx) '■ 
{X,rx, fJ'x) G r} is pre-compact in the Gromov-Hausdorff topology. 

Under these assumptions we can choose for every sequence in F a subse- 
quence (Afm)meN, = iXm,rXm,fJ'X,J, and a metric space {X,rx), such 
that 

(7.12) dGH(X,Xm)"^0. 

By Lemma IA.1|, there are a compact metric space {Z,rz) and isomet- 
ric embeddings fx, fXn fX2i ■■■ from X, Xi, X2, respectively, to 
Z, such that di{[fxiX),(pXr„iXm)) 0. Since Z is compact, the set 

{ifXm)*l^x„i ■ m £ N} is pre-compact in M.i{Z) equipped with the weak 
topology. Therefore {(pXm)*^^Xm has a converging subsequence, and (a) fol- 
lows in this case. 

In the second step we consider the general case. Let := 2~", fix for 
every ^ G F and every n G N, a; € Kg^^^x- Put 

(7.13) fix,n{-) ■■= f^xi- n K,^,x) + (1 - ^^x{Ke^,x))SA■) 
and let A'" := {X,rx, l-i'X,n)- By construction, for all ^ G F, 

(7.14) (iGPr(-^",'^) <£«, 

and fj.x,n is supported by Kgr^^x- Hence, F„ := {Af"; Af G F} is pre-compact 
in Xc equipped with the Gromov-Hausdorff topology, for all n G N. We can 
therefore find a converging subsequence in F„, for all n, by the first step. 

By a diagonal argument we find a subsequence (<^m,)meN with Xm = 
{Xm^rxmi t^Xm) such that (^^)meN converges for every n G N to some 
metric measure space Z^- Pick a subsequence such that for all n G N and 
m > n, 

(7.15) dG^r{x:^,Zn)<em. 
Then 

(7.16) dGPr(^^,-^;^0 <2en, 
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for all m, m! > n. We conclude that {Xn)n£N is a Cauchy sequence in 
(M, (icPr) since X^„>ien < oo. Indeed, 

(7.17) < dGPr('^n, ^n) + '^GPr('^^, ^n+l) + C^GPr('^n+l) ^n+l) 

Since (M, dcPr) is complete, this sequence converges and we are done. □ 



8. Tightness 

In Proposition 17. II we have given a characterization for relative compact- 
ness in M with respect to the Gromov-Prohorov topology. This characteri- 
zation extends to the following tightness characterization in A^i(M) which is 
equivalent to Theorem[3l once we have shown the equivalence of the Gromov- 
Prohorov and the Gromov-weak topology in Theorem [5] in Section [TOl 

Proposition 8.1 (Tightness with respect to the Gromov-Prohorov topol- 
ogy). A set A C A4i{M.) is tight with respect to the Gromov-Prohorov topol- 
ogy on M if and only if for all e > there exist 6 > and C > such that 

(8.1) suvF[vs{X)+wx{[C;oo))] <e. 

PeA 



Proof of Proposition \8.1[ For the "only if" direction assume that A is tight 
and fix e > 0. By definition, we find a compact set in (M, dcPr) such 
that infpgA IP'(r£) > 1 — e/4. Since is compact there are, by part (b) of 
Proposition [7711 6 = 6{e) > and C = C{e) > such that vs{X) < e/4 and 
wxi[C,oo)) < e/4, for all X G T^. Furthermore both vs{-) and w.{[C,oo)) 
are bounded above by 1. Hence for all P E A, 

¥[vs{X)+w;^{[C,cx^))] 

= r[vs{?i:) + wx{[C, oo)); r,] +F[vs{X) + wx{[C, cx))); Cr,] 



e e 
<2 + 2 



= e. 

Therefore (HI]) holds. 

For the "if" direction assume (18.11) is true and fix e > 0. For all n G 



there are (5„ > and C„ > such that 

^ ■ ^ PeA 

By Tschebychev's inequality, we conclude that for all n E N, 

(8.4) supP{^ : vsM)+ wx{[Cn, oo)) > 2-"e} < 2-"e. 

PeA 
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By the equivalence of (a) and (b) in Proposition 17.11 the closure of 

oo 

(8.5) r, := n {-^ : ^sS^) + wx{[Cn, oo)) < 

n=l 

is compact. We conclude 

p(r;) > p(r,) 

oo 

(8.6) >1-^P{^: vs„{X) + wx{[Cn,^))> 

n=l 

> 1 - e. 

Since e was arbitrary, A is tight. □ 

9. Gromov-Prohorov AND Gromov-weak topology coincide 

In this section we show that the topologies induced by convergence of 
polynomials and convergence in the Gromov-Prohorov metric coincide. This 
implies that the characterizations of compact subsets of M and tight families 
in Ali(M) in Gromov-weak topology stated in Theorems [2] and [3] are covered 
by the corresponding characterizations with respect to the Gromov-Prohorov 
topology given in Propositions 17. l l and 18. 1| respectively. Recall the distance 
matrix distribution from Definition 12.71 

Theorem 5. Let X,Xi,X2, ... G M. The following are equivalent: 

(a) The Gromov-Prohorov metric converges, i.e., 

(9.1) dGPr(A'„,A') ^^0. 

(b) Distance matrix distributions converge, i.e. 

(9.2) i/'^" ^v-^ as n ^ oo. 

(c) All polynomials $ G 11 converge, i.e., 

(9.3) ^{Xn) ^X). 

Proof, (a) =^ (6). Let X = {X,rx,lJ-x), Xi = {Xi,ri,i2i), X2 = (^2, '^2, /"2)- 
By Lemma 15.81 there are a complete and separable metric space {Z,rz) 
and isometric embeddings ip, (pi, (p2,.-- from {X,rx), {Xi,ri), {X2,r2), 
respectively, to {Z,rz) such that {(pn)*lJ'n converges weakly to on 
{Z,rz). Consequently, using (j2.4p . 
(9.4) 

(6) =^ (c). This is a consequence of the two different representation of 
polynomials from (j2.2p and (j2.6p . 

(c) (a). Assume that for all $ e B, $(A'„) ^^-^ ^{X). It is enough to 
show that the sequence {Xn)n<^N is pre-compact with respect to the Gromov- 
Prohorov topology, since by Proposition 12.61 this would imply that all limit 
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points coincide and equal X. We need to check the two conditions guaran- 
teeing pre-compactness given by Part (b) of Proposition 17.11 

By Part (iii) of Proposition 16.61 the map X i— > wx is continuous with 
respect to the Gromov-weak topology. Hence, the family {wx„'-, n G N} is 
tight. 

In addition, by Parts (i) and (iv) of Proposition [6^ limsup„_^oo vs[Xn) < 

vs{X) 0. By Remark 12.111 the latter implies ()7.ip . and we are done. 

□ 



10. Equivalent metrics 

In Section [5] we have seen that M equipped with the Gromov-Prohorov 
metric is separable and complete. In this section we conclude the paper by 
presenting further metrics (not necessarily complete) which are all equivalent 
to the Gromov-Prohorov metric and which may be in some situations easier 
to work with. 

The Eurandom metric0 Recall from Definition 12.31 the algebra of polyno- 
mials, i.e., functions which evaluate distances of finitely many points sam- 
pled from a metric measure space. By Proposition [2]6l polynomials separate 
points in M. Consequently, two metric measure spaces are different if and 
only if the distributions of sampled finite subspaces are different. 

We therefore define 
(10.1) 

d^nr{X,y) :=inf inf{e>0: fL'^^{{x,y),{x\y') {X xVf : 

\rx{x,x') - ry(y,y')l > e} < «^}, 

where the infimum is over all couplings /i of fix and /xy. We will refer to 
rfEur as the Eurandom metric. 

Not only is dsur a metric on M, it also generates the Gromov-Prohorov 
topology. 

Proposition 10.1 (Equivalent metrics). The distance d-Eur is a metric on 
M. It is equivalent to dcPr; *-e., the generated topology is the Gromov-weak 
topology. 

Before we prove the proposition we give an example to show that the 
Eurandom metric is not complete. 



When we first discussed how to metrize the Gromov-weak topology the Eurandom 
metric came up. Since the discussion took place during a meeting at Eurandom, we 
decided to name the metric accordingly. 
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Example 10.2 (Eurandom metric is not complete). Let for all n € N, 

Xn := {Xn,rn, fJ-n) as in Example I2.12( ii). For all n € N, 

(10.2) 

< inf {e > : fin+i)^H\H^ = x'} - l{y = y'}\ > e} < e} 

i.e., (Af„)„gN is a Cauchy sequence for dsur which does not converge. Hence 
(M, c^Eur) is not complete. The Gromov-Prohorov metric was shown to be 
complete, and hence the above sequence is not Cauchy in this metric. Indeed, 

n^oo 

(10.3) dGPr{Xn,Xn+l) = 2'^ ^ 0. 

□ 

To prepare the proof of Proposition 110. H we provide bounds on the in- 
troduced "distances" . 

Lemma 10.3 (Equivalence). Let X,y ^M, and 5 G (0, \). 

(i) IfdEnr{X,y) < 5^ then dGPr{X,y) < 12{2vsiX) + 5) . 
(ii) 

(10.4) dEur{X,y) <2dGPr{X,y). 

Proof, (i) The Gromov-Prohorov metric relies on the Prohorov metric of 
embeddings of fix and /iy in A4i(Z) in a metric space {Z,rz)- This is in 
contrast to the Eurandom metric which is based on an optimal coupling of 
the two measures fix and fiy without referring to a space of measures over 
a third metric space. Since we want to bound the Gromov-Prohorov metric 
in terms of the Eurandom metric the main goal of the proof is to construct 
a suitable metric space (Z, r^). 

The construction proceeds in three steps. We start in Step 1 with finding 
a suitable e-net {xi, ...,xn} in (X, rx), and show that this net has a suitable 
corresponding net {yi, y^} in (Y, ry). In Step 2 we then verify that these 
nets have the property that rx{xi,Xj) ryiyi^yj) (where the 'r^' is made 
precise below) and (5-balls around these nets carry almost all fix- and /Uy- 
mass. Finally, in Step 3 we will use these nets to define a metric space (Z, rz) 
containing both {X,rx) and (y, ry), and bound the Prohorov metric of the 
images of fix and fiy- 

Step 1 (Construction of suitable e -nets in X andY). Fix 5 € (0,^). Assume 
that X ,y ^ M. are such that dEar{X ,y^ < d'^. By definition, we find a 
coupling fi of fix and fiy such that 

(10.5) /i®'{(xi,yi),(x2,y2) : \rx{xi,X2) - ry{yi,y2)\> 25} < 5\ 
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Set e := Avs{X) > 0. By Lemma lOl there are < [^J points xi, ...,xn E 
X with pairwise distances at least e, 

(10.6) f^{Be{xi)) > 6, 
for all i = 1, N, and 

(10.7) fi{\J,^^B,{xi))>l-e. 

PutD:= U-=i Bs{xi). We claim that for every i = 1, A'" there is S y 
with 

(10.8) fl{Bsixi) X B2^e+5){yi)) > (1 - 6^)flx{BeiXi)). 

Indeed, assume the assertion is not true for some 1 < i < A^. Then, for 
all y G y, 

(10.9) fi{Be{xi) X Ci32(,+5)(y)) > 6^^J.x{Be{Xi)). 

which implies that 

fl'^'^{{x',y'),{x",y") : \rx{x',x") -ry(y',y")| > 26} 

> A^n(x',2/'), ix",y") : x',x" G B,{x,),y" ^ i?2(.+5)(y')} 

> ^^x{B,{x.^f6^ 
>6\ 

by (I619D and (fTOQ]) which contradicts (fTTTHD . 

S'iep S (Distortion of{xi, ...,xn} and {yi, y„} Assume that {xi, ...,xiy} 
and {2/1, y„} are such that (I10.6P through (|10.8p hold. We claim that then 

(10.11) \rx{xi,Xj) - rY{yi,yj)\ <Q{e + 6), 

for all i,j = l,...,N. Assume that (110. lip is not true for some pair {i,j)- 
Then for all x' G Bs{xi), x" G Bs{xj), y' G 52{£+5)(yj), and y" G B2(s+5){yj), 

(10.12) \rx{x',x")-rYiy',y")\ > 6(8 + 6) - 2e - i{e + 6) = 26. 

Then 
(10.13) 

fl'^'^{{x',y'),{x" ,y") : \rx{x',x") -rY{y',y")\ > 26} 
>fi^mx',y'),{x",y"): 

x' G Be{xi),x" G Be{xj),y' G 52(^+5) (^i), / e 52(e+5)(yi)} 
= jl{B^{Xi) X B2(^e+5){yi))iJ'{Be{Xj) X 52(£+5) (l/i)) 

><52(l-<5)2 

where we used ([09]) and 5 < i. Since (|10.13p contradicts p^3]) . we 

are done. 
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Step 3 (Definition of a suitable metric space {Z,rz))- Define the relation 
R := {{xi,yi) : i = 1, N} between X and Y and consider the metric space 
{Z,rz) defined hy Z := X UY and rz ■= ''xuy given as in Remark 15.51 
Choose isometric embeddings (fx and (pY from (X,rx) and (y, ry), re- 
spectively, into {Z,rz)- As dis(i?) < 6(e + 5) (see (|5.7p for definition), by 
Remark 15.51 rz{^x{xi), ipY{y-i)) < 3(e + (5), for all i = 1, N . 

If X G X and y ^ Y are such that rz{'Px{x)-,^Y{y)) > 6(e + 5) and 
rx{x, Xi) < £ then 

rY{y,yi) > rz{ipx{x),(pY{y)) - rx{x,Xi) - rz{^x{xi),^Y{yi)) 

(10.14) > 6(e + 5) -e-3(e + <5) 
> 2(e + 5) 

and so for all x € B^{xi), 

(10.15) {yey: rz((/^x(x),(^y(y)) >6(e + (^)} cCi32(e+5)(yi)- 

Let /i be the probability measure on Z x Z defined by jl{A x B) := 
^(^^(^) X lPy^{B)), for all yl, B G B{Z). Therefore, by (fUnTD . (fTTmi) . 
(fTilH]) and as < [1/5J, 
(10.16) 

fi{{z,z'):rz{z,z')>Q{£ + 5)} 

N 

< /i(^x(CD) X (pYiY)) + a( U ^e(^x(a;i)) X CB2(e+5)((/7y(yi))) 

i=l 

N 

<£ + Y,^'x{Be{x^))6^ 

i=l 
<£ + 6. 

Hence, using (15. 4|) and e = 4vs{X), 

(10.17) 4?'''^ ((V'x)*/xx, {^Y)*fiY) < 6{4vs{X) + 2<5) , 

and so dcpj^^X ,y) < 12(2vs{X) + 6), as claimed. 

(ii) Assume that dcp^i^X ,y) < 6. Then, by definition, there exists a met- 
ric space {Z,rz), isometric embeddings fx and ipY between supp(;Ux) and 
supp(/xy) and Z, respectively, and a coupling fi of {fx)*fJ'X and (v3y)*/iy 
such that 

(10.18) il{{z, z') : rz{z, z') >8}<8. 

Hence with the special choice of a coupling /i of iix and /xy defined by 
Il{A xB) = ^{<fx{A) X <PYiB)), for all A G S(A) and B G i3(y), 

eXxy : |rx(x,x')-ry(y,y')l >2<5} 

?'z(9'x(a;),(/?y(y)) > 5 or rz(v'x(a::')) '/'y (^0) > 
< 2(5. 
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This implies that iiEur(<^,3^) < 2(5. 
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Proof of Proposition ] 10.1[ Observe that by Lemma 16.51 vs{X) — — > 0. So 
Lemma 110.31 imphes the equivalence of dcpr and c^Eur once we have shown 
that dEur is indeed a metric. 

The symmetry is clear. If X, y ^M. are such that dEuv{^ ,y) = 0, by 
equivalence, dcp^iX ,y) = and hence X = y. 

For the triangle inequality, let Xi = {Xi,ri,fj,i) G M.,i = 1,2,3, be such 
that (iEur('^i> -^2) < £ and (iEur('^2> -^3) < S for some e,6 > 0. Then there 
exist couplings /li^2 of fii and fi2 and /i2,3 of 112 and /is with 

(10.20) flfl{{xi,X2),ix[,x'2) : \ri{xi,x[) - r2ix2,x'2)\ > e] < e 
and 

(10.21) fif^{{x2,X3),{x2,x'3) : \r2{x2,X2) -r3(x3,X3)| >6}<6. 

Introduce the transition kernel K2^3 from X2 to defined by 

(10.22) /i2,3(d(a;2, xs)) = /i2(da;2)-ftr2,3(a;2, dxs). 

which exists since X2 and X3 are Polish. 

Using this kernel, define a coupling /ii^3 of /xi and fis by 



(10.23) /ii,3(d(xi,X3)) := / /ii,2(d(xi, X2))K2,3(a;2, dx3). 

Then 
(10.24) 

/if^3{(xi,X3), (a;'i,4) : \ri{xi,x[) -r3(a;3,4)| > £ + 5} 



X2xX|xX| 



/il,2(d(xi,X2))/il,2(d(x'i,X2))K2,3(2;2,dX3)K2,3(2;2,dX3) 

l{\ri{xi,x[) -r3(2;3,X3)| > e + 6} 



< / /il,2(d(xi,X2))/il,2(d(x'i,X2))K2,3(2;2,dX3)i^2,3(2;2,dX3) 



{l{\ri{xi,x[) - r2(x2,X2)| > e} + l{|r2(x2,X2) - r3(x3,X3)| > 6}) 
= flf^{{xi,X2), (xi,X2) : \ri{xi,x[) - r2(x2,X2)| > e} 

+ f^2^{{x2,X3),{x2,x'3) ■■ \r2{x2,X2) - r3{x3,x'3)\ > 6} 

<e + 5 

which yields (iEur('^i, < e + 5. □ 
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The Gromov-Wasserstein and the modified Eurandom metric. The 

topology of weak convergence for probabiUty measures on a fixed metric 
space {Z, r) is generated not only by the Prohorov metric, but also by 

(10.25) (i^'''^)(/ii,/X2) := iiif / /2(d(x,x')) {r{x,x') A l), 



M JZxZ 

where the infimum is over all couplings jl of /ii and ^2- This is a version 
of the Wasserstein metric (see, for example, [Rac91j ). If we rely on the 
Wasserstein rather than the Prohorov metric, this results in two further 
metrics: in the Gromov-Wasserstein metric, i.e., 

(10.26) dGw{X,y)-= inf d[^''''\iipx)*fix,M*f^Y), 

where the infimum is over all isometric embeddings from supp(/xx) and 
supp(/xy) into a common metric Z and in the modified Eurandom metric 

d'^^,{x,y) := 

(10.27) f 

inf / ji{(l{x,y))ji{d.{x ,y')){\rx{x,x) -rY{y,y')\M), 

where the infimum is over all couplings of ^.x and /iy. 

Remark 10.4. An L^-version of dew on the set of compact metric measure 
spaces is already used in |Stu06] . It turned out that the metric is complete 
and the generated topology is separable. □ 

Altogether, we might ask if we could achieve similar bounds to those 
given in Lemma [10. 3 1 by exchanging the Gromov-Prohorov with the Gromov- 
Wasserstein metric and the Eurandom with the modified Eurandom metric. 

Proposition 10.5. The distances dcw and d'-^^^^ define metrics onM. They 
all generate the Gromov-Prohorov topology. Bounds that relate these two 
metrics with dcPr cind dsur o-fe for X.,y ^ M, 

(10.28) (dGPr(^, 3^))' < dGV^{X, y) < dGPr(A', y) 

and 

(10.29) {dEnri^, y)? < d'E^,{X, y) < dEur{X, y) 

Consequently, the Gromov-Wasserstein metric is complete. 



Proof. The fact that c^gw and d'^^^ define metrics on M is proved analogously 
as for the Gromov-Prohorov and the Eurandom metric. The Prohorov and 
the version of the Wasserstein metric used in (jl0.26p and (|10.27p on fixed 
metric spaces can be bounded uniformly (see, for example. Theorem 3 in 
[GS02| ). This immediately carries over to the present case. □ 



WEAK CONVERGENCE OF RANDOM METRIC MEASURE SPACES 



35 



Appendix A. Additional facts on Gromov-Hausdorff 

CONVERGENCE 

Recall the notion of the Gromov-Hausdorff distance on the space He of 
isometry classes of compact metric spaces given in (I5.5P . We give a statement 
concerning convergence in the Gromov-Hausdorff metric which is analogous 
to Lemma 15.81 for Gromov-Prohorov convergence. 

Lemma A.l. Let {X,rx), {Xi,rxi), {X2,rx2), ■■■ be in\c- Then 

dGH(X„,X)"=^0 

if and only if there is a compact metric space (Z, rz) and isometric embed- 
dings if>, ipi, Lp2, ■■■ of {X,r), (Xi,rxi), {X2,rx2), respectively, into 
{Z, rz) such that 

(A.l) dif''^-)((^„(X„),v.(X))"=^0. 

Proof. The "if" -direction is clear. So we come immediately to the "only if" 
direction. If dGu{Xn, X) ".z^ q, then by (|5.8p we find correspondences Rn 
between X and Xn such that dis(i?„) ^ 0. Using these and Xq := X, we 
define recursively metrics r^^ on Z„ := |Jfc=o"'^'s- ^'^^^t, set Zi := Xq U Xi 
and rzi '■= r^^ (recall Remark 15. 5p . In the n^^ step, we are given a metric 
on Zn- Consider the canonical isometric embedding ip from X to Zn and 
define the relation Rn ^ Zn x Xn+i by 

(A.2) Rn+i := {{z,x) eZnX Xn+1 : ((^"^(z),x) G Rn+l}, 

Ft 

and set rz„_f_i '■= ■ ^^^^ procedure we end up with a metric rz on 
Z := U^o isometric embeddings ipo, ^pi,..- between Xq, Xi, ... and 

Z, respectively, such that 

(A.3) dif'''^)((^„(X„),v9(X)) = ^dis(i?„) ™ 0. 

W.l.o.g. we can assume that Z is complete. Otherwise we just embed ev- 
erything into the completion of Z. To verify compactness of {Z,rz) it is 
therefore sufficient to show that Z is totally bounded (see, for example, 
Theorem 1.6.5 in (BBIOlj ). For that purpose fix e > 0, and let n € N. Since 
X is compact, we can choose a finite e/2-net S in X. Then for all x £ Z 
with rz{x,X) < e/2 there exists x' £ S such that rz{x,x') < e. Moreover, 
dii{^n{Xn),^{X)^ < e, for all but finitely many n € N. For the remaining 
iPn{Xn) choose finite e-nets and denote their union by S. In this way, SU S 
is a finite set, and {i3e(s) : s G S* U 5} is a covering of Z. □ 
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